RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: _ - 

Application Serial Number: 6°{ At ^ O j 9 ZS 
Source: ' OXPF. 

Date Processed by STIC: <3—(<C- Ot— 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308^1212. 
FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 

PATENTIN 2.1 e-ihail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

r 

, r 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://vyww.uspto.gov/ebc/efs/downloads/documents.htni> » EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U:S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7* Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 191 1 South Clark Street, Arlington, VA 22202 

Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, r other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 

Revised 01/29/2002 
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Raw Sequence Listing Error Summary , 



3ER: 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISRECARD ENCLtSII "ALPHA" HEADERS. WHICH WERE INSERTED DY FTO SOFTWARE 

Wrapped Aminos - was retrieved in a \ 

»• 



2. 
3 



8 



10 



11 



12 



Wrapped Nucleics The numberAexl at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos - was retrieved in a word processor after creating il Please adjust your right margin to .3; this will 

prevent "wrapping. 1 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

_Misaligned Amino The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

hJon-ASCII The submitted file was not saved in ASCII(IX)S) text, as required by the Sequence Rules. Please 

ensure your subsequent submission Is saved In ASCII text 



Variable Length Sequences) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xta can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 



Patentln 2.0 
* "bug" 



A "bug" in Patentln version 2.0 has caused the <220>-<223> sectidn to be missing from amino acid 

sequences^) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial oi* Unknown sequences. 

^Skipped Sequences Sequencers) missing. If intentional, please insert the following lines for each skipped sequence: 

"(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

Skipped Sequences Sequencers) , missing. If Intentional, please insert the following lines for each skipped sequence. 
(NEW RULES) <2 10> sequence id number 

<400> sequence id number 

000 



Use of n's or Xaa's 
"(NEW RULES) 



.Invalid <213> 
Response 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown. Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2 1 3> response is Unjcnown or 
is Artificial Sequence 



Use of <220> 



Sequences) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Patentln 2.0 
" "bug" 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(Sec "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function f Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" r any ther manual means t copy file to Jloppy disk. 



AMC - Biotechnology Systems Branch - 06704/2001 




# 



Does Not Comply 
Corrected Diskette Needed 



RAW SEQUENCE LISTING DATE: 03/15/2002 

PATENT APPLICATION: US/09/940,925 TIME: 14:41:44 

Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 

SEQUENCE LISTING 

4 (1) GENERAL INFORMATION: 



6 (i) APPLICANT: BROW, MARY ANN D. 

7 LYAMICHEV, VICTOR I. 

8 OLIVE, DAVID M. 

10 (ii) TITLE OF INVENTION: RAPID DETECTION AND IDENTIFICATION OF 

11 PATHOGENS 
13 (iii) NUMBER OF SEQUENCES: 165 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: MEDLEN & CARROLL 

17 (B) STREET: 220 MONTGOMERY STREET, SUITE 2200 

18 (C) CITY: SAN FRANCISCO 

19 (D) STATE: CALIFORNIA 

20 (E) COUNTRY: UNITED STATES OF AMERICA 

21 (F) ZIP: 94104 

23 (V) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC compatible 

26 (C) OPERATING SYSTEM: PC-DOS/MS -DOS 

27 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
29 (vi) CURRENT APPLICATION DATA: 

C--> 30 (A) APPLICATION NUMBER: US/09/940,925 

C--> 31 (B) FILING DATE: 28-Aug-2001 

32 (C) CLASSIFICATION: 

34 (viii) ATTORNEY/AGENT INFORMATION: 

3 5 (A) NAME: CARROLL, PETER G. 

36 (B) REGISTRATION NUMBER: 32,837 

37 (C) REFERENCE/DOCKET NUMBER: FORS-01756 

39 (ix) TELECOMMUNICATION INFORMATION: 

40 (A) TELEPHONE: (415) 705-8410. 

41 (B) TELEFAX: (415) 397-8338 



ERRORED SEQUENCES 

44 (2) INFORMATION FOR SEQ ID NO: 1: 



46 (i) SEQUENCE CHARACTERISTICS: 

47 (A) LENGTH: 2506 base pairs 

48 (B) TYPE: nucleic acid 

4 9 (C) STRANDEDNESS: double 

50 (D) TOPOLOGY: linear 

52 (ii) MOLECULE TYPE: DNA (genomic) 

56 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



file://C:\Crf3\Outhold\VsrI940925.htm 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/940,925 

Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 



E--> 58 ATGAGGGGGA TGCTGCCCCT CTTTGAGCCC AAGGGCCGGG TCCTCCTGGT 

59 GGACGGCCAC 60 
E--> 61 CACCTGGCCT ACCGCACCTT CCACGCCCTG AAGGGCCTCA CCACCAGCCG 

62 GGGGGAGCCG 120 
E—> 64 GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTCAAGGA 

65 GGACGGGGAC 180 
E--> 67 GCGGTGATCG TGGTCTTTGA CGCCAAGGCC CCCTCCTTCC GCCACGAGGC 

68 CTACGGGGGG 240 
E--> 70 TACAAGGCGG GCCGGGCCCC CACGCCGGAG GACTTTCCCC GGCAACTCGC 

71 CCTCATCAAG 300 
E--> 73 GAGCTGGTGG ACCTCCTGGG GCTGGCGCGC CTCGAGGTCC CGGGCTACGA 

74 GGCGGACGAC 360 
E--> 76 GTCCTGGCCA GCCTGGCCAA GAAGGCGGAA AAGGAGGGCT ACGAGGTCCG 

77 CATCCTCACC 420 
E--> 79 GCCGACAAAG ACCTTTACCA GCTCCTTTCC GACCGCATCC ACGTCCTCCA 

80 CCCCGAGGGG 480 
E--> 82 TACCTCATCA CCCCGGCCTG GCTTTGGGAA AAGTACGGCC TGAGGCCCGA 

83 CCAGTGGGCC 540 
E--> 85 GACTACCGGG CCCTGACCGG GGACGAGTCC GACAACCTTC CCGGGGTCAA 

86 GGGCATCGGG 600 
E--> 88 GAGAAGACGG CGAGGAAGCT TCTGGAGGAG TGGGGGAGCC TGGAAGCCCT 

89 CCTCAAGAAC 660 
E--> 91 CTGGACCGGC TGAAGCCCGC CATCCGGGAG AAGATCCTGG CCCACATGGA 

92 CGATCTGAAG 720 
E--> 94 CTCTCCTGGG ACCTGGCCAA GGTGCGCACC GACCTGCCCC TGGAGGTGGA 

95 CTTCGCCAAA 780 
E--> 97 AGGCGGGAGC CCGACCGGGA GAGGCTTAGG GCCTTTCTGG AGAGGCTTGA 

98 GTTTGGCAGC 840 
E--> 100 CTCCTCCACG AGTTCGGCCT TCTGGAAAGC CCCAAGGCCC TGGAGGAGGC 

101 CCCCTGGCCC 900 
E--> 103 CCGCCGGAAG GGGCCTTCGT GGGCTTTGTG CTTTCCCGCA AGGAGCCCAT 

104 GTGGGCCGAT 960 
E--> 106 CTTCTGGCCC TGGCCGCCGC CAGGGGGGGC CGGGTCCACC GGGCCCCCGA 

107 GCCTTATAAA 1020 
E--> 109 GCCCTCAGGG ACCTGAAGGA GGCGCGGGGG CTTCTCGCCA AAGACCTGAG 

110 CGTTCTGGCC 1080 
E--> 112 CTGAGGGAAG GCCTTGGCCT CCCGCCCGGC GACGACCCCA TGCTCCTCGC 

113 CTACCTCCTG . 1140 
E--> 115 GACCCTTCCA ACACCACCCC CGAGGGGGTG GCCCGGCGCT ACGGCGGGGA 

116 GTGGACGGAG 1200 
E--> 118 GAGGCGGGGG AGCGGGCCGC CCTTTCCGAG AGGCTCTTCG CCAACCTGTG 

119 GGGGAGGCTT 1260 
E--> 121 GAGGGGGAGG AGAGGCTCCT TTGGCTTTAC CGGGAGGTGG AGAGGCCCCT 

122 TTCCGCTGTC 1320 
E--> 124 CTGGCCCACA TGGAGGCCAC GGGGGTGCGC CTGGACGTGG CCTATCTCAG 

125 GGCCTTGTCC 1380 
E--> 127 CTGGAGGTGG CCGAGGAGAT CGCCCGCCTC GAGGCCGAGG TCTTCCGCCT 

128 GGCCGGCCAC 1440 
E--> 13 0 CCCTTCAACC TCAACTCCCG GGACCAGCTG GAAAGGGTCC TCTTTGACGA 



DATE: 03/15/2002 
TIME: 14:41:44 




Je type of errors shown exist throughout 
»e Sequence Listing. Please check subsequc 
sequences for similar errors 



tu 
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RAW SEQUENCE LISTING DATE: 03/15/2002 

PATENT APPLICATION: US/09/940,925 TIME: 14:41:44 

Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 

131 GCTAGGGCTT 1500 
E--> 133 CCCGCCATCG GCAAGACGGA GAAGACCGGC AAGCGCTCCA CCAGCGCCGC 

134 CGTCCTGGAG 1560 
E--> 136 GCCCTCCGCG AGGCCCACCC. CATC GTG GAG AAGATCCTGC AGTACCGGGA 

137 GCTCACCAAG 1620 
E--> 139 CTGAAGAGCA CCTACATTGA CCCCTTGCCG GACCTCATCC ACCCCAGGAC 

140 GGGCCGCCTC 1680 
E--> 14 2 CACACCCGCT TCAACCAGAC GGCCACGGCC ACGGGCAGGC TAAGTAGCTC 

14 3 CGATCCCAAC 1740 
E--> 14 5 CTCCAGAACA TCCCCGTCCG CACCCCGCTT GGGCAGAGGA TCCGCCGGGC 

14 6 CTTCATCGCC 1800 
E--> 14 8 GAGGAGGGGT GGCTATTGGT GGCCCTGGAC TATAGCCAGA TAGAGCTCAG 

149 GGTGCTGGCC 1860 
E--> 151 CACCTCTCCG GCGACGAGAA CCTGATCCGG GTCTTCCAGG AGGGGCGGGA 

152 CATCCACACG 1920 
E--> 154 GAGACCGCCA GCTGGATGTT CGGCGTCCCC CGGGAGGCCG TGGACCCCCT 

155 GATGCGCCGG 1980 
E--> 157 GCGGCCAAGA CCATCAACTT CGGGGTCCTC TACGGCATGT CGGCCCACCG 

158 CCTCTCCCAG 2040 
E--> 160 GAGCTAGCCA TCCCTTACGA GGAGGCCCAG GCCTTCATTG AGCGCTACTT 

161 TCAGAGCTTC 2100 
E--> 163 CCCAAGGTGC GGGCCTGGAT TGAGAAGACC CTGGAGGAGG GCAGGAGGCG 

164 GGGGTACGTG 2160 
E--> 166 GAGACCCTCT TCGGCCGCCG CCGCTACGTG CCAGACCTAG AGGCCCGGGT 

167 GAAGAGCGTG 2220 
E--> 169 CGGGAGGCGG CCGAGCGCAT GGCCTTCAAC ATGCCCGTCC AGGGCACCGC 

170 CGCCGACCTC 2280 
E--> 172 ATGAAGCTGG CTATGGTGAA GCTCTTCCCC AGGCTGGAGG AAATGGGGGC 

173 CAGGATGCTC 2340 
E--> 175 CTTCAGGTCC ACGACGAGCT GGTCCTCGAG GCCCCAAAAG AGAGGGCGGA 

176 GGCCGTGGCC 2400 
E--> 178 CGGCTGGCCA AGGAGGTCAT GGAGGGGGTG TATCCCCTGG CCGTGCCCCT 

179 GGAGGTGGAG 2460 
E--> 181 GTGGGGATAG GGGAGGACTG GCTCTCCGCC AAGGAGTGAT ACCACC 
W--> 182 2506 

184 (2) INFORMATION FOR SEQ ID NO : 2: 

186 (i) SEQUENCE CHARACTERISTICS: 

187 (A) LENGTH: 2496 base pairs 

188 (B) TYPE: nucleic acid 

189 (C) STRANDEDNESS : double 

190 (D) TOPOLOGY: linear 
192 (ii) MOLECULE TYPE: DNA (genomic) 
196 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

E--> 198 ATGGCGATGC TTCCCCTCTT TGAGCCCAAA GGCCGCGTGC TCCTGGTGGA 

199 CGGCCACCAC 60 
E--> 201 CTGGCCTACC GCACCTTCTT TGCCCTCAAG GGCCTCACCA CCAGCCGCGG 

202 CGAACCCGTT 120 
E--> 204 CAGGCGGTCT ACGGCTTCGC CAAAAGCCTC CTCAAGGCCC TGAAGGAGGA 

205 CGGGGACGTG 180 



file://C:\Crf3\Outhold\VsrI940925.htm 



3/15/02 



Page 4 of 23 



RAW SEQUENCE LISTING DATE: 03/15/2002 

PATENT APPLICATION: US/09/940, 925 TIME: 14:41:44 

Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 

E--> 207 GTGGTGGTGG TCTTTGACGC CAAGGCCCCC TCCTTCCGCC ACGAGGCCTA 

208 CGAGGCCTAC 240 
E--> 210 AAGGCGGGCC GGGCCCCCAC CCCGGAGGAC TTTCCCCGGC AGCTGGCCCT 

211 CATCAAGGAG 300 
E--> 213 TTGGTGGACC TCCTAGGCCT TGTGCGGCTG GAGGTTCCCG GCTTTGAGGC 

214 GGACGACGTG 360 
E--> 216 CTGGCCACCC TGGCCAAGCG GGCGGAAAAG GAGGGGTACG AGGTGCGCAT 

217 CCTCACTGCC 420 
E--> 219 GACCGCGACC TCTACCAGCT CCTTTCGGAG CGCATCGCCA TCCTCCACCC 

220 TGAGGGGTAC 4 80 

E--> 222 CTGATCACCC CGGCGTGGCT TTACGAGAAG TACGGCCTGC GCCCGGAGCA 

223 GTGGGTGGAC 540 
E--> 225 TACCGGGCCC TGGCGGGGGA CCCCTCGGAT AACATCCCCG GGGTGAAGGG 

226 CATCGGGGAG 600 
E--> 228 AAGACCGCCC AGAGGCTCAT CCGCGAGTGG GGGAGCCTGG AAAACCTCTT 

229 CCAGCACCTG 660 
E--> 231 GACCAGGTGA AGCCCTCCTT GCGGGAGAAG CTCCAGGCGG GCATGGAGGC 

232 CCTGGCCCTT 720 
E--> 234 TCCCGGAAGC TTTCCCAGGT GCACACTGAC CTGCCCCTGG AGGTGGACTT 

235 CGGGAGGCGC 780 
E--> 237 CGCACACCCA ACCTGGAGGG TCTGCGGGCT TTTTTGGAGC GGTTGGAGTT 

238 TGGAAGCCTC . 840 
E--> 240 CTCCACGAGT TCGGCCTCCT GGAGGGGCCG AAGGCGGCAG AGGAGGCCCC 

241 CTGGCCCCCT 900 
E--> 243 CCGGAAGGGG CTTTTTTGGG CTTTTCCTTT TCCCGTCCCG AGCCCATGTG 

244 GGCCGAGCTT 960 
E--> 246 CTGGCCCTGG CTGGGGCGTG GGAGGGGCGC CTCCATCGGG CACAAGACCC 

247 CCTTAGGGGC 1020 
E--> 249 CTGAGGGACC TTAAGGGGGT GCGGGGAATC CTGGCCAAGG ACCTGGCGGT 

250 TTTGGCCCTG 1080 
E--> 252 CGGGAGGGCC TGGACCTCTT CCCAGAGGAC GACCCCATGC TCCTGGCCTA 

2 53 CCTTCTGGAC 114 0 
E--> 255 CCCTCCAACA CCACCCCTGA GGGGGTGGCC CGGCGTTACG GGGGGGAGTG 

256 GACGGAGGAT 1200 
E--> 258 GCGGGGGAGA GGGCCCTCCT GGCCGAGCGC CTCTTCCAGA CCCTAAAGGA 

259 GCGCCTTAAG 1260 
E--> 261 GGAGAAGAAC GCCTGCTTTG GCTTTACGAG GAGGTGGAGA AGCCGCTTTC 

262 CCGGGTGTTG 1320 
E--> 264 GCCCGGATGG AGGCCACGGG GGTCCGGCTG GACGTGGCCT ACCTCCAGGC 

265 CCTCTCCCTG 1380 
E--> 267 GAGGTGGAGG CGGAGGTGCG CCAGCTGGAG GAGGAGGTCT TCCGCCTGGC 

268 CGGCCACCCC 1440 
E--> 270 TTCAACCTCA ACTCCCGCGA CCAGCTGGAG CGGGTGCTCT TTGACGAGCT 

271 GGGCCTGCCT 1500 
E--> 273 GCCATCGGCA AGACGGAGAA GACGGGGAAA CGCTCCACCA GCGCTGCCGT 

274 GCTGGAGGCC 1560 
E--> 276 CTGCGAGAGG CCCACCCCAT CGTGGACCGC ATCCTGCAGT ACCGGGAGCT 

277 CACCAAGCTC 1620 
E--> 279 AAGAACACCT ACATAGACCC CCTGCCCGCC CTGGTCCACC CCAAGACCGG 
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RAW SEQUENCE LISTING DATE: 03/15/2002 

PATENT APPLICATION: US/09/940,925 TIME: 14:41:44 

Input Set : A:\Seq-sub.app 
Output Set: N:\CRF3\03152002\I940925.raw 

280 CCGGCTCCAC 1680 
E--> 282 ACCCGCTTCA ACCAGACGGC CACCGCCACG GGCAGGCTTT CCAGCTCCGA 

283 CCCCAACCTG 1740 
E--> 285 CAGAACATCC CCGTGCGCAC CCCTCTGGGC CAGCGCATCC GCCGAGCCTT 

286 CGTGGCCGAG 1800 
E--> 2 88 GAGGGCTGGG TGCTGGTGGT CTTGGACTAC AGCCAGATTG AGCTTCGGGT 

289 CCTGGCCCAC 1860 
E--> 291 CTCTCCGGGG ACGAGAACCT GATCCGGGTC TTTCAGGAGG GGAGGGACAT 

292 CCACACCCAG 1920 
E--> 294 ACCGCCAGCT GGATGTTCGG CGTTTCCCCC GAAGGGGTAG ACCCTCTGAT 

295 GCGCCGGGCG 1980 
E--> 297 GCCAAGACCA TCAACTTCGG GGTGCTCTAC GGCATGTCCG CCCACCGCCT 

298 CTCCGGGGAG 2040 
E--> 3 00 CTTTCCATCC CCTACGAGGA GGCGGTGGCC TTCATTGAGC GCTACTTCCA 

301 GAGCTACCCC 2100 
E--> 3 03 AAGGTGCGGG CCTGGATTGA GGGGACCCTC GAGGAGGGCC GCCGGCGGGG 

304 GTATGTGGAG 2160 
E--> 3 06 ACCCTCTTCG GCCGCCGGCG CTATGTGCCC GACCTCAACG CCCGGGTGAA 

307 GAGCGTGCGC 2220 
E--> 309 GAGGCGGCGG AGCGCATGGC CTTCAACATG CCGGTCCAGG GCACCGCCGC 

310 CGACCTCATG 2280 
E--> 312 AAGCTGGCCA TGGTGCGGCT TTTCCCCCGG CTTCAGGAAC TGGGGGCGAG 

313 GATGCTTTTG 2340 
E--> 315 CAGGTGCACG ACGAGCTGGT CCTCGAGGCC CCCAAGGACC GGGCGGAGAG 

316 GGTAGCCGCT 2400 
E--> 318 TTGGCCAAGG AGGTCATGGA GGGGGTCTGG CCCCTGCAGG TGCCCCTGGA 

319 GGTGGAGGTG 2460 

321 GGCCTGGGGG AGGACTGGCT CTCCGCCAAG GAGTAG 2496 
323 (2) INFORMATION FOR SEQ ID NO: 3: 

325 (i) SEQUENCE CHARACTERISTICS: 

326 (A) LENGTH: 2504 base pairs 

327 (B) TYPE: nucleic acid 

328 (C) STRANDEDNESS : double 

329 (D) TOPOLOGY: linear 

331 (ii) MOLECULE TYPE: DNA (genomic) 

335 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

E--> 3 37 ATGGAGGCGA TGCTTCCGCT CTTTGAACCC AAAGGCCGGG TCCTCCTGGT 

3 38 GGACGGCCAC 60 
E--> 340 CACCTGGCCT ACCGCACCTT CTTCGCCCTG AAGGGCCTCA CCACGAGCCG p 

341 GGGCGAACCG 120 
E--> 343 GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTGAAGGA 

344 GGACGGGTAC 180 
E--> 346 AAGGCCGTCT TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA 

347 GGCCTACGAG 240 
E--> 349 GCCTACAAGG CGGGGAGGGC CCCGACCCCC GAGGACTTCC CCCGGCAGCT 

350 CGCCCTCATC 300 
E--> 352 AAGGAGCTGG TGGACCTCCT GGGGTTTACC CGCCTCGAGG TCCCCGGCTA 

353 CGAGGCGGAC 360 
E--> 355 GACGTTCTCG CCACCCTGGC CAAGAAGGCG GAAAAGGAGG GGTACGAGGT 
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RAW SEQUENCE LISTING 
PATENT APPLICATION : US/09/940,925 

Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 



DATE: 03/15/2002 
TIME: 14:41:44 



356 GCGCATCCTC 420 
E--> 358 ACCGCCGACC GCGACCTCTA CCAACTCGTC TCCGACCGCG TCGCCGTCCT 

359 CCACCCCGAG 480 
E--> 361 GGCCACCTCA TCACCCCGGA GTGGCTTTGG GAGAAGTACG GCCTCAGGCC 

362 GGAGCAGTGG 540 
E--> 364 GTGGACTTCC GCGCCCTCGT GGGGGACCCC TCCGACAACC TCCCCGGGGT 

365 CAAGGGCATC 600 
E--> 367 GGGGAGAAGA CCGCCCTCAA GCTCCTCAAG GAGTGGGGAA GCCTGGAAAA 

3 68 CCTCCTCAAG 660 
E--> 370 AACCTGGACC GGGTAAAGCC AGAAAACGTC CGGGAGAAGA TCAAGGCCCA 

371 CCTGGAAGAC 720 
E--> 373 CTCAGGCTCT CCTTGGAGCT CTCCCGGGTG CGCACCGACC TCCCCCTGGA 

374 GGTGGACCTC 780 
E--> 376 GCCCAGGGGC GGGAGCCCGA CCGGGAGGGG CTTAGGGCCT TCCTGGAGAG 

377 GCTGGAGTTC 840 
E--> 379 GGCAGCCTCC TCCACGAGTT CGGCCTCCTG GAGGCCCCCG CCCCCCTGGA 

380 GGAGGCCCCC 900 
E--> 3 82 TGGCCCCCGC CGGAAGGGGC CTTCGTGGGC TTCGTCCTCT CCCGCCCCGA 

383 GCCCATGTGG 960 
E--> 3 85 GCGGAGCTTA AAGCCCTGGC CGCCTGCAGG GACGGCCGGG TGCACCGGGC 

3 86 AGCAGACCCC 1020 

E--> 3 88 TTGGCGGGGC TAAAGGACCT CAAGGAGGTC CGGGGCCTCC TCGCCAAGGA 

389 CCTCGCCGTC 1080 
E--> 3 91 TTGGCCTCGA GGGAGGGGCT AGACCTCGTG CCCGGGGACG ACCCCATGCT 

392 CCTCGCCTAC 1140 
E--> 394 CTCCTGGACC CCTCCAACAC CACCCCCGAG GGGGTGGCGC GGCGCTACGG 

395 GGGGGAGTGG 1200 
E--> 397 ACGGAGGACG CCGCCCACCG GGCCCTCCTC TCGGAGAGGC TCCATCGGAA 

398 CCTCCTTAAG 1260 
E--> 400 CGCCTCGAGG GGGAGGAGAA GCTCCTTTGG CTCTACCACG AGGTGGAAAA 

401 GCCCCTCTCC 1320 
E--> 4 03 CGGGTCCTGG CCCACATGGA GGCCACCGGG GTACGGCTGG ACGTGGCCTA 

404 CCTTCAGGCC 1380 
E--> 4 06 CTTTCCCTGG AGCTTGCGGA GGAGATCCGC CGCCTCGAGG AGGAGGTCTT 

407 CCGCTTGGCG 1440 
E--> 4 09 GGCCACCCCT TCAACCTCAA CTCCCGGGAC CAGCTGGAAA GGGTGCTCTT 

410 TGACGAGCTT 1500 
E--> 412 AGGCTTCCCG CCTTGGGGAA GACGCAAAAG ACAGGCAAGC GCTCCACCAG 

413 CGCCGCGGTG 1560 
E--> 415 CTGGAGGCCC TACGGGAGGC CCACCCCATC GTGGAGAAGA TCCTCCAGCA 

416 CCGGGAGCTC 1620 
E--> 418 ACCAAGCTCA AGAACACCTA CGTGGACCCC CTCCCAAGCC TCGTCCACCC 

419 GAGGACGGGC 1680 
E--> 4 21 CGCCTCCACA CCCGCTTCAA CCAGACGGCC ACGGCCACGG GGAGGCTTAG 

4 22 TAGCTCCGAC 1740 

E--> 4 24 CCCAACCTGC AGAACATCCC CGTCCGCACC CCCTTGGGCC AGAGGATCCG 

425 CCGGGCCTTC 1800 
E--> 427 GTGGCCGAGG CGGGTTGGGC GTTGGTGGCC CTGGACTATA GCCAGATAGA 

428 GCTCCGCGTC 1860 



5^ 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/940 f 925 

Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 



DATE: 03/15/2002 
TIME: 14:41:44 



E--> 430 CTCGCCCACC TCTCCGGGGA CGAAAACCTG ATCAGGGTCT TCCAGGAGGG 

431 GAAGGACATC 1920 
E--> 433 CACACCCAGA CCGCAAGCTG GATGTTCGGC GTCCCCCCGG AGGCCGTGGA 

434 CCCCCTGATG 1980 
E--> 436 CGCCGGGCGG CCAAGACGGT GAACTTCGGC GTCCTCTACG GCATGTCCGC 

437 CCATAGGCTC 2040 
E--> 439 TCCCAGGAGC TTGCCATCCC CTACGAGGAG GCGGTGGCCT TTATAGAGGC 

440 TACTTCCAAA 2100 
E--> 442 GCTTCCCCAA GGTGCGGGCC TGGATAGAAA AGACCCTGGA GGAGGGGAGG 

443 AAGCGGGGCT 2160 
E--> 445 ACGTGGAAAC CCTCTTCGGA AGAAGGCGCT ACGTGCCCGA CCTCAACGCC 

44 6 CGGGTGAAGA 2220 
E--> 44 8 GCGTCAGGGA GGCCGCGGAG CGCATGGCCT TCAACATGCC CGTCCAGGGC 

449 ACCGCCGCCG 2280 
E--> 451 ACCTCATGAA GCTCGCCATG GTGAAGCTCT TCCCCCGCCT CCGGGAGATG 

452 GGGGCCCGCA 23 40 
E--> 454 TGCTCCTCCA GGTCCACGAC GAGCTCCTCC TGGAGGCCCC CCAAGCGCGG 

455 GCCGAGGAGG 2400 
E--> 457 TGGCGGCTTT GGCCAAGGAG GCCATGGAGA AGGCCTATCC CCTCGCCGTG 

458 CCCCTGGAGG 2460 
E--> 460 TGGAGGTGGG GATGGGGGAG GACTGGCTTT CCGCCAAGGG TTAG 
W--> 461 2504 

977 (2) INFORMATION FOR SEQ ID NO : 7: 

979 (i) SEQUENCE CHARACTERISTICS: 

980 (A) LENGTH: 2502 base pairs 

981 (B) TYPE: nucleic acid 

982 (C) STRANDEDNESS : single 

983 (D) TOPOLOGY: linear 

985 (ii) MOLECULE TYPE: DNA (genomic) 

989 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

--> 991 ATGNNGGCGA TGCTTCCCCT CTTTGAGCCC AAAGGCCGGG TCCTCCTGGT 

992 GGACGGCCAC 60 
--> 994 CACCTGGCCT ACCGCACCTT CTTCGCCCTG AAGGGCCTCA CCACCAGCCG 

995 GGGCGAACCG 120 
--> 997 GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTGAAGGA 
998 GGACGGGGAC 180 

1000 NNGGCGGTGN TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA 

1001 GGCCTACGAG 240 
--> 1003 GCCTACAAGG CGGGCCGGGC CCCCACCCCG GAGGACTTTC CCCGGCAGCT 

1004 CGCCCTCATC 3 00 

--> 1006 AAGGAGCTGG TGGACCTCCT GGGGCTTGCG CGCCTCGAGG TCCCCGGCTA 

1007 CGAGGCGGAC 360 
--> 1009 GACGTNCTGG CCACCCTGGC CAAGAAGGCG GAAAAGGAGG GGTACGAGGT 

1010 GCGCATCCTC 420 
--> 1012 ACCGCCGACC GCGACCTCTA CCAGCTCCTT TCCGACCGCA TCGCCGTCCT 

1013 CCACCCCGAG 480 
--> 1015 GGGTACCTCA TCACCCCGGC GTGGCTTTGG GAGAAGTACG GCCTGAGGCC 

1016 GGAGCAGTGG 540 
--> 1018 GTGGACTACC GGGCCCTGGC GGGGGACCCC TCCGACAACC TCCCCGGGGT 



E 
E 
E 
E 
E 
E 
E 
E 
E 
E 
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1019 
E--> 1021 

1022 
E--> 1024 

1025 
E--> 1027 

1028 
E--> 1030 

1031 
E--> 1033 

1034 
E--> 1036 

1037 
E--> 1039 

1040 
E--> 1042 

1043 
E--> 1045 

1046 
E--> 1048 

1049 
E--> 1051 

1052 
E--> 1054 

1055 
E--> 1057 

1058 
E--> 1060 

1061 
E--> 1063 

1064 
E--> 1066 

1067 
E--> 1069 
. 1070 
E--> 1072 

1073 
E--> 1075 

1076 
E--> 1078 

1079 
E--> 1081 

1082 
E--> 1084 

1085 
E--> 1087 

1088 
E--> 1090 

1091 



CAAGGGCATC 
GGGGAGAAGA 
CCTCCTCAAG 
AACCTGGACC 
GGANGACCTG 
ANGCTCTCCT 
GGACTTCGCC 
AAGNGGCGGG 
GGAGTTTGGC 
AGCCTCCTCC 
GGCCCCCTGG 
CCCCCGCCGG 
CATGTGGGCC 
GAGCTTCTGG 
AGACCCCTTT 
ANGGGCCTNA 
GGCCGTTTTG 
GCCCTGAGGG 
CGCCTACCTC 
CTGGACCCCT 
GGAGTGGACG 
GAG GANG CGG 
NNNGCAGCGC 
CTTGAGGGGG 
CCTTTCCCGG 
GTCCTGGCCC 
CCAGGCCCTN 
TCCCTGGAGG 
CCTGGCCGGC 
CACCCCTTCA 
CGAGCTNGGG 
CTTCCCGCCA 
CGCCGTGCTG 
GAGGCCCTNC 
GGAGCTCACC 
AAGCTCAAGA 
GACGGGCCGC 
CTCCACACCC 
CTCCGACCCC 
AACCTGCAGA 
GGCCTTCGTG 
GCCGAGGAGG 
CCGGGTCCTG 
GCCCACCTCT 
GGACATCCAC 
ACCCAGACCG 
CCTGATGCGC 
CGGGCGGCCA 
CCGCCTCTCC 



600 
CCGCCCNGAA 

660 
GGGTGAAGCC 

720 
GGGAGCTNTC 

780 
AGCCCGACCG 

840 
ACGAGTTCGG 

900 
AAGGGGCCTT 

960 
CCCTGGCCGC 

1020 
GGGACCTNAA 

1080 
AGGGCCTNGA 

1140 
CCAACACCAC 

1200 
GGGAGCGGGC 

1260 
AGGAGAGGCT 

1320 
ACATGGAGGC 

1380 
TGGCGGAGGA 

1440 
ACCTCAACTC 

1500 
TCGGCAAGAC 

1560 
GNGAGGCCCA 

1620 
ACACCTACAT 

1680 
GCTTCAACCA 

1740 
ACATCCCCGT 

1800 
GNTGGGTGTT 

1860 
CCGGGGACGA 

1920- 
CCAGCTGGAT 

1980 
AG AC CATC AA 

2040 



GCTCCTCNAG 
CGCCNTCCGG 
CCAGGTGCGC 
GGAGGGGCTT 
CCTCCTGGAG 
CGTGGGCTTT 
CGCCAGGGAG 
GGAGGTGCGG 
CCTCNTGCCC 
CCCCGAGGGG 
CCTCCTNTCC 
CCTTTGGCTT 
CACGGGGGTN 
GATCCGCCGC 
CCGGGACCAG 
GGAGAAGACN 
CCCCATCGTG 
NGACCCCCTG 
GACGGCCACG 
CCGCACCCCN 
GGTGGCCCTG 
GAACCTGATC 
GTTCGGCGTC 
CTTCGGGGTC 



GAGTGGGGGA GCCTGGAAAA 



GAGAAGATCC AGGCCCACAT 



ACCGACCTGC CCCTGGAGGT 



AGGGCCTTTC TGGAGAGGCT 



GGCCCCAAGG CCCTGGAGGA 



GTCCTTTCCC GCCCCGAGCC 



GGCCGGGTCC ACCGGGCACC 



GGNCTCCTCG CCAAGGACCT 



GGGGACGACC CCATGCTCCT 



GTGGCCCGGC GCTACGGGGG 



GAGAGGCTCT TCCNGAACCT 



TACCAGGAGG TGGAGAAGCC 



CGGCTGGACG TGGCCTACCT 



CTCGAGGAGG AGGTCTTCCG 



CTGGAAAGGG TGCTCTTTGA 



GGCAAGCGCT CCACCAGCGC 



GAGAAGATCC TGCAGTACCG 



CCNGNCCTCG TCCACCCCAG 



GCCACGGGCA GGCTTAGTAG 



CTGGGCCAGA GGATCCGCCG 



GACTATAGCC AGATAGAGCT 



CGGGTCTTCC AGGAGGGGAG 



CCCCCGGAGG CCGTGGACCC 



CTCTACGGCA TGTCCGCCCA 
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E--> 1093 CAGGAGCTTG CCATCCCCTA CGAGGAGGCG GTGGCCTTCA TTGAGCGCTA 

1094 CTTCCAGAGC 2100 
E--> 1096 TTCCCCAAGG TGCGGGCCTG GATTGAGAAG ACCCTGGAGG AGGGCAGGAG 

1097 GCGGGGGTAC 2160 
E--> 1099 GTGGAGACCC TCTTCGGCCG CCGGCGCTAC GTGCCCGACC TCAACGCCCG 

1100 GGTGAAGAGC 2220 
E--> 1102 GTGCGGGAGG CGGCGGAGCG CATGGCCTTC AACATGCCCG TCCAGGGCAC 

1103 CGCCGCCGAC 2280 
E--> 1105 CTCATGAAGC TGGCCATGGT GAAGCTCTTC CCCCGGCTNC AGGAAATGGG 

1106 GGCCAGGATG 2340 
E--> 1108 CTCCTNCAGG TCCACGACGA GCTGGTCCTC GAGGCCCCCA AAGAGCGGGC 



1109 GGAGGNGGTG 2400 
E--> 1111 GCCGCTTTGG CCAAGGAGGT CATGGAGGGG GTCTATCCCC TGGCCGTGCC 

1112 CCTGGAGGTG 2460 
E--> 1114 GAGGTGGGGA TGGGGGAGGA CTGGCTCTCC GCCAAGGAGT AG 
W--> 1115 2502 

1290 (2) INFORMATION FOR SEQ ID NO : 9: 

1292 (i) SEQUENCE CHARACTERISTICS: 

1293 (A) LENGTH: 1647 base pairs 

1294 (B) TYPE: nucleic acid 

1295 (C) STRANDEDNESS : double 

1296 (D) TOPOLOGY: linear 

1298 (ii) MOLECULE TYPE: DNA (genomic) 

1302 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

E--> 13 04 ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT 

13 05 GGTGGACGGC 60 
E--> 1307 CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG 

1308 CCGGGGGGAG 120 
E--> 1310 CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA 

1311 GGAGGACGGG 180 
E--> 1313 GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA 

1314 GGCCTACGGG 240 
E--> 1316 GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC CCCGGCAACT 

1317 CGCCCTCATC 300 
E--> 1319 AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA 

1320 CGAGGCGGAC 360 
E--> 1322 GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG GCTACGAGGT 

1323 CCGCATCCTC 420 
E--> 1325 ACCGCCGACA AAGACCTTTA CCAGCTCCTT TCCGACCGCA TCCACGTCCT 

1326 CCACCCCGAG 480 
E--> 1328 GGGTACCTCA TCACCCCGGC CTGGCTTTGG GAAAAGTACG GCCTGAGGCC 

1329 CGACCAGTGG 540 
E--> 1331 GCCGACTACC GGGCCCTGAC CGGGGACGAG TCCGACAACC TTCCCGGGGT 

1332 CAAGGGCATC 600 
E--> 1334 GGGGAGAAGA CGGCGAGGAA GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC 

13 35 CCTCCTCAAG 660 
E--> 1337 AACCTGGACC GGCTGAAGCC CGCCATCCGG GAGAAGATCC TGGCCCACAT 

1338 GGACGATCTG 720 
E--> 1340 AAGCTCTCCT GGGACCTGGC CAAGGTGCGC ACCGACCTGC CCCTGGAGGT 
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1341 
E--> 1343 

1344 
E--> 1346 

1347 
E--> 1349 

1350 
E--> 1352 

1353 
E--> 1355 

1356 
E--> 1358 

1359 
E--> 1361 

1362 
E--> 1364 

1365 
E--> 1367 

1368 
E--> 1370 

1371 
E--> 1373 

1374 
E--> 1376 

1377 
E--> 1379 

1380 
E--> 1382 

1383 

1385 

1387 

1389 

1390 

1391 

1392 

1393 

1395 

1399 
E--> 1401 

1402 
E--> 1404 

1405 
E--> 1407 

1408 
E--> 1410 

1411 
E--> 1413 

1414 
E--> 1416 



GGACTTCGCC 780 

AAAAGGCGGG AGCCCGACCG GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT 
TGAGTTTGGC 840 

AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA AGCCCCAAGG CCCTGGAGGA 

GGCCCCCTGG 900 

CCCCCGCCGG AAGGGGCCTT CGTGGGCTTT GTGCTTTCCC GCAAGGAGCC 

CATGTGGGCC 960 

GATCTTCTGG CCCTGGCCGC CGCCAGGGGG GGCCGGGTCC ACCGGGCCCC 
CGAGCCTTAT 1020 

AAAGCCCTCA GGGACCTGAA GGAGGCGCGG GGGCTTCTCG CCAAAGACCT 
GAGCGTTCTG 1080 

GCCCTGAGGG AAGGCCTTGG CCTCCCGCCC GGCGACGACC CCATGCTCCT 
CGCCTACCTC 1140 

CTGGACCCTT CCAACACCAC CCCCGAGGGG GTGGCCCGGC GCTACGGCGG 
GGAGTGGACG 1200 

GAGGAGGCGG GGGAGCGGGC CGCCCTTTCC GAGAGGCTCT TCGCCAACCT 
GTGGGGGAGG 1260 

CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT TACCGGGAGG TGGAGAGGCC 
CCTTTCCGCT 1320 

GTCCTGGCCC ACATGGAGGC CACGGGGGTG CGCCTGGACG TGGCCTATCT 
CAGGGCCTTG 1380 

TCCCTGGAGG TGGCCGGGGA GATCGCCCGC CTCGAGGCCG AGGTCTTCCG 
CCTGGCCGGC 1440 

CACCCCTTCA ACCTCAACTC CCGGGACCAG CTGGAAAGGG TCCTCTTTGA 

CGAGCTAGGG 1500 

CTTCCCGCCA TCGGCAAGAC GGAGAAGACC GGCAAGCGCT CCACCAGCGC 
CGCCGTCCTG 1560 

GAGGCCCTCC GCGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGGCATG 
CAAGCTTGGC 1620 
ACTGGCCGTC GTTTTACAAC GTCGTGA 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT 

GGTGGACGGC 60 

CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG 
CCGGGGGGAG 120 

CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA 

GGAGGACGGG 180 

GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA 
GGCCTACGGG 240 

GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC CCCGGCAACT 

CGCCCTCATC 300 

AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA 



1647 
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1417 
E--> 1419 

1420 
E--> 1422 

1423 
E--> 1425 

1426 
E--> 1428 

1429 
E--> 1431 

1432 
E--> 1434 

1435 
E--> 1437 

1438 
E--> 1440 

1441 
E--> 1443 

1444 
E--> 1446 

1447 
E--> 1449 

1450 
E--> 1452 

1453 
E--> 1455 

1456 
E--> 1458 

1459 
E--> 1461 

1462 
E--> 1464 

1465 
E--> 1467 

1468 
E--> 1470 

1471 
E--> 1473 

1474 
E--> 1476 

1477 
E--> 1479 

1480 
E--> 1482 

1483 
E--> 1485 

1486 
E--> 1488 

1489 



CGAGGCGGAC 
GACGTCCTGG 
CCGCATCCTC 
ACCGCCGACA 
CCACCCCGAG 
GGGTACCTCA 
CGACCAGTGG 
GCCGACTACC 
CAAGGGCATC 
GGGGAGAAGA 
CCTCCTCAAG 
AACCTGGACC 
GGACGATCTG 
AAGCTCTCCT 
GGACTTCGCC 
AAAAGGCGGG 
TGAGTTTGGC 
AGCCTCCTCC 
GGCCCCCTGG 
CCCCCGCCGG 
CATGTGGGCC 
GATCTTCTGG 
CGAGCCTTAT 
AAAGCCCTCA 
GAGCGTTCTG 
GCCCTGAGGG 
CGCCTACCTC 
CTGGACCCTT 
GGAGTGGACG 
GAGGAGGCGG 
GTGGGGGAGG 
CTTGAGGGGG 
CCTTTCCGCT 
GTCCTGGCCC 
CAGGGCCTTG 
TCCCTGGAGG 
CCTGGCCGGC 
CACCCCTTCA 
CGAGCTAGGG 
CTTCCCGCCA 
CGCCGTCCTG 
GAGGCCCTCC 
GGAGCTCACC 
AAGCTGAAGA 
GACGGGCCGC 
CTCCACACCC 
CTCCGATCCC 
AACCTCCAGA 
GGCCTTCATC 



360 
CCAGCCTGGC 

420 
AAGACCTTTA 

480 
TCACCCCGGC 

540 
GGGCCCTGAC 

600 
CGGCGAGGAA 

660 
GGCTGAAGCC 

720 
GGGACCTGGC 

780 
AGCCCGACCG 

840 
ACGAGTTCGG 

900 
AAGGGGCCTT 

960 
CCCTGGCCGC 

1020 
GGGACCTGAA 

1080 
AAGGCCTTGG 

1140 
CCAACACCAC 

1200 
GGGAGCGGGC 

1260 
AGGAGAGGCT 

1320 
ACATGGAGGC 

1380 
TGGCCGGGGA 

1440 
ACCTCAACTC 

1500 
TCGGCAAGAC 

1560 
GCGAGGCCCA 

1620 
GCACCTACAT 

1680 
GCTTCAACCA 

1740 
ACATCCCCGT 
1800 



CAAGAAGGCG GAAAAGGAGG GCTACGAGGT 



CCAGCTCCTT TCCGACCGCA TCCACGTCCT 



CTGGCTTTGG GAAAAGTACG GCCTGAGGCC 



CGGGGACGAG TCCGACAACC TTCCCGGGGT 



GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC 



CGCCATCCGG GAGAAGATCC TGGCCCACAT 



CAAGGTGCGC ACCGACCTGC CCCTGGAGGT 



GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT 



CCTTCTGGAA AGCCCCAAGG CCCTGGAGGA 



CGTGGGCTTT GTGCTTTCCC GCAAGGAGCC 



CGCCAGGGGG GGCCGGGTCC ACCGGGCCCC 



GGAGGCGCGG GGGCTTCTCG CCAAAGACCT 



CCTCCCGCCC GGCGACGACC CCATGCTCCT 



CCCCGAGGGG GTGGCCCGGC GCTACGGCGG 



CGCCCTTTCC GAGAGGCTCT TCGCCAACCT 



CCTTTGGCTT TACCGGGAGG TGGAGAGGCC 



CACGGGGGTG CGCCTGGACG TGGCCTATCT 



GATCGCCCGC CTCGAGGCCG AGGTCTTCCG 



CCGGGACCAG CTGGAAAGGG TCCTCTTTGA 



GGAGAAGACC GGCAAGCGCT CCACCAGCGC 



CCCCATCGTG GAGAAGATCC TGCAGTACCG 



TGACCCCTTG CCGGACCTCA TCCACCCCAG 



GACGGCCACG GCCACGGGCA GGCTAAGTAG 



CCGCACCCCG CTTGGGCAGA GGATCCGCCG 
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E--> 14 91 GCCGAGGAGG GGTGGCTATT GGTGGCCCTG GACTATAGCC AG AT AG AG CT 

14 92 CAGGGTGCTG 1860 
E--> 14 94 GCCCACCTCT CCGGCGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGCG 

14 95 GGACATCCAC 1920 
E--> 14 97 ACGGAGACCG CCAGCTGGAT GTTCGGCGTC CCCCGGGAGG CCGTGGACCC 

1498 CCTGATGCGC 1980 
E--> 1500 CGGGCGGCCA AG AC CATC AA CTTCGGGGTC CTCTACGGCA TGTCGGCCCA 

1501 CCGCCTCTCC 2040 
E--> 1503 CAGGAGCTAG CTAGCCATCC CTTACGAGGA GGCCCAGGCC TTCATTGA 
W--> 1504 2088 

1506 (2) INFORMATION FOR SEQ ID NO: 11: 



1508 (i) SEQUENCE CHARACTERISTICS: 

1509 (A) LENGTH: 962 base pairs 

1510 (B) TYPE: nucleic acid 

1511 (C) STRANDEDNESS: double 

1512 (D) TOPOLOGY: linear 

1514 (ii) MOLECULE TYPE: DNA (genomic) 

1518 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



E--> 1520 ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT 

1521 GGTGGACGGC 60 
E--> 1523 CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG 

1524 CCGGGGGGAG 120 
E--> 1526 CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA 

1527 GGAGGACGGG 180 
E--> 1529 GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA 

1530 GGCCTACGGG 240 
E--> 1532 GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC CCCGGCAACT 

1533 CGCCCTCATC 300 
E--> 1535 AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA 

1536 CGAGGCGGAC 360 
E--> 1538 GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG GCTACGAGGT 

1539 CCGCATCCTC 420 
E--> 1541 ACCGCCGACA AAGACCTTTA CCAGCTTCTT TCCGACCGCA TCCACGTCCT 

1542 CCACCCCGAG 480 
E--> 1544 GGGTACCTCA TCACCCCGGC CTGGCTTTGG GAAAAGTACG GCCTGAGGCC 

1545 CGACCAGTGG 540 
E--> 1547 GCCGACTACC GGGCCCTGAC CGGGGACGAG TCCGACAACC TTCCCGGGGT 

1548 CAAGGGCATC 600 
E--> 1550 GGGGAGAAGA CGGCGAGGAA GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC 

1551 CCTCCTCAAG 660 
E--> 1553 AACCTGGACC GGCTGAAGCC CGCCATCCGG GAGAAGATCC TGGCCCACAT 

1554 GGACGATCTG 720 
E--> 1556 AAGCTCTCCT GGGACCTGGC CAAGGTGCGC ACCGACCTGC CCCTGGAGGT 

1557 GGACTTCGCC 780 
E--> 1559 AAAAGGCGGG AGCCCGACCG GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT 

1560 TGAGTTTGGC 840 
E--> 1562 AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA AGCCCCAAGT CATGGAGGGG 

1563 GTGTATCCCC 900 
E--> 1565 TGGCCGTGCC CCTGGAGGTG GAGGTGGGGA TAGGGGAGGA CTGGCTCTCC 
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1566 GCCAAGGAGT 960 
1568 GA 

1570 (2) INFORMATION FOR SEQ ID NO: 12: 

1572 (i) SEQUENCE CHARACTERISTICS: 

1573 (A) LENGTH: 1600 base pairs 

1574 (B) TYPE: nucleic acid 

1575 (C) STRANDEDNESS : double 

1576 (D) TOPOLOGY: linear 

1578 (ii) MOLECULE TYPE: DNA (genomic) 

1582 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

E--> 1584 ATGGAATTCG GGGATGCTGC CCCTCTTTGA GCCCAAGGGC CGGGTCCTCC 

1585 TGGTGGACGG -60 
E--> 1587 CCACCACCTG GCCTACCGCA CCTTCCACGC CCTGAAGGGC CTCACCACCA 

1588 GCCGGGGGGA 120 
E--> 1590 GCCGGTGCAG GCGGTCTACG GCTTCGCCAA GAGCCTCCTC AAGGCCCTCA 

1591 AGGAGGACGG 180 
E--> 1593 GGACGCGGTG ATCGTGGTCT TTGACGCCAA GGCCCCCTCC TTCCGCCACG 

1594 AGGCCTACGG 240 
E--> 1596 GGGGTACAAG GCGGGCCGGG CCCCCACGCC GGAGGACTTT CCCCGGCAAC 

1597 TCGCCCTCAT 300 
E--> 1599 CAAGGAGCTG GTGGACCTCC TGGGGCTGGC GCGCCTCGAG GTCCCGGGCT 

1600 ACGAGGCGGA 360 
E--> 1602 CGACGTCCTG GCCAGCCTGG CCAAGAAGGC GGAAAAGGAG GGCTACGAGG 

1603 TCCGCATCCT 420 
E--> 1605 CACCGCCGAC AAAGACCTTT ACCAGCTCCT TTCCGACCGC ATCCACGTCC 

1606 TCCACCCCGA 480 
E--> 1608 GGGGTACCTC ATCACCCCGG CCTGGCTTTG GGAAAAGTAC GGCCTGAGGC 

1609 CCGACCAGTG 540 
E--> 1611 GGCCGACTAC CGGGCCCTGA CCGGGGACGA GTCCGACAAC CTTCCCGGGG 

1612 TCAAGGGCAT 600 
E--> 1614 CGGGGAGAAG ACGGCGAGGA AGCTTCTGGA GGAGTGGGGG AGCCTGGAAG 

1615 CCCTCCTCAA 660 
E--> 1617 GAACCTGGAC CGGCTGAAGC CCGCCATCCG GGAGAAGATC CTGGCCCACA 

1618 TGGACGATCT 720 
E--> 1620 GAAGCTCTCC TGGGACCTGG CCAAGGTGCG CACCGACCTG CCCCTGGAGG 

. 1621 TGGACTTCGC 780 
E--> 1623 CAAAAGGCGG GAGCCCGACC GGGAGAGGCT TAGGGCCTTT CTGGAGAGGC 

1624 TTGAGTTTGG 840 
E--> 1626 GAGCCTCCTC CACGAGTTCG GCCTTCTGGA AAGCCCCAAG ATCCGCCGGG 

1627 CCTTCATCGC 900 
E--> 1629 CGAGGAGGGG TGGCTATTGG TGGCCCTGGA CTATAGCCAG ATAGAGCTCA 

1630 GGGTGCTGGC 960 
E--> 1632 CCACCTCTCC GGCGACGAGA ACCTGATCCG GGTCTTCCAG GAGGGGCGGG 

1633 ACATCCACAC 1020 
E--> 1635 GGAGACCGCC AGCTGGATGT TCGGCGTCCC CCGGGAGGCC GTGGACCCCC 

163 6 TGATGCGCCG 1080 
E--> 1638 GGCGGCCAAG ACCATCAACT TCGGGGTCCT CTACGGCATG TCGGCCCACC 

1639 GCCTCTCCCA 1140 
E--> 1641 GGAGCTAGCC ATCCCTTACG AGGAGGCCCA GGCCTTCATT GAG CGC TACT 
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PATENT APPLICATION: US/09/940,925 TIME: 14:41:44 

Input Set : A:\Seq-sub.app 
Output Set: N:\CRF3\03152002\I940925.raw 

1642 TTCAGAGCTT 1200 
E--> 1644 CCCCAAGGTG CGGGCCTGGA TTGAGAAGAC CCTGGAGGAG GGCAGGAGGC 

1645 GGGGGTACGT 1260 
E--> 1647 GGAGACCCTC TTCGGCCGCC GCCGCTACGT GCCAGACCTA GAGGCCCGGG 

1648 TGAAGAGCGT 1320 
E--> 1650 GCGGGAGGCG GCCGAGCGCA TGGCCTTCAA CATGCCCGTC CGGGGCACCG 

1651 CCGCCGACCT 1380 
E--> 1653 CATGAAGCTG GCTATGGTGA AGCTCTTCCC CAGGCTGGAG GAAATGGGGG 

1654 CCAGGATGCT 1440 
E--> 1656 CCTTCAGGTC CACGACGAGC TGGTCCTCGA GGCCCCAAAA GAGAGGGCGG 

1657 AGGCCGTGGC 1500 
E--> 1659 CCGGCTGGCC AAGGAGGTCA TGGAGGGGGT GTATCCCCTG GCCGTGCCCC 

1660 TGGAGGTGGA 1560 
E--> 1662 GGTGGGGATA GGGGAGGACT GGCTCTCCGC CAAGGAGTGA 
W--> 1663 1600 

1697 (2) INFORMATION FOR SEQ ID NO: 15: 

1699 (i) SEQUENCE CHARACTERISTICS: 

1700 (A) LENGTH: 91 base pairs 

1701 (B) TYPE: nucleic acid 

1702 (C) STRANDEDNESS : single 

1703 (D) TOPOLOGY: linear 

1705 (ii) MOLECULE TYPE: DNA (genomic) 

1709 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

E--> 1711 TAATACGACT CACTATAGGG AGACCGGAAT TCGAGCTCGC CCGGGCGAGC 
1712 TCGAATTCCG 60 

1714 TGTATTCTAT AGTGTCACCT AAATCGAATT C 91 
1764 (2) INFORMATION FOR SEQ ID NO: 19: 

1766 (i) SEQUENCE CHARACTERISTICS: 

1767 (A) LENGTH: 42 base pairs 

1768 (B) TYPE: nucleic acid 

1769 (C) STRANDEDNESS: single 

1770 (D) TOPOLOGY: linear 

1772 (ii) MOLECULE TYPE: DNA (genomic) 

1776 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

E--> 1778 GGATCCTCTA GAGTCGACCT GCAGGCATGC CTACCTTGGT AG 
W--> 1779 42 

1797 (2) INFORMATION FOR SEQ ID NO: 21: 

1799 (i) SEQUENCE CHARACTERISTICS: 

1800 (A) LENGTH: 2502 base pairs 

1801 (B) TYPE: nucleic acid 

1802 (C) STRANDEDNESS: double 

1803 (D) TOPOLOGY: linear 

1805 (ii) MOLECULE TYPE: DNA (genomic) 

1809 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

E--> 1811 ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT 

1812 GGTGGACGGC 60 
E--> 1814 CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG 

1815 CCGGGGGGAG 120 
E--> 1817 CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA 
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1818 GGAGGACGGG 180 
E--> 1820 GACGCGGTGA TCGTGGTCTT 

1821 GGCCTACGGG 240 
E--> 1823 GGGTACAAGG CGGGCCGGGC 

1824 CGCCCTCATC 300 
E--> 1826 AAGGAGCTGG TGGACCTCCT 

1827 CGAGGCGGAC 360 
E--> 1829 GACGTCCTGG CCAGCCTGGC 

1830 CCGCATCCTC 420 
E--> 1832 ACCGCCGACA AAGACCTTTA 

1833 CCACCCCGAG 480 
E--> 1835 GGGTACCTCA TCACCCCGGC 

1836 CGACCAGTGG 540 
E--> 1838 GCCGACTACC GGGCCCTGAC 

183 9 CAAGGGCATC 600 
E--> 1841 GGGGAGAAGA CGGCGAGGAA 

184 2 CCTCCTCAAG 660 
E--> 1844 AACCTGGACC GGCTGAAGCC 

184 5 GGACGATCTG 720 
E--> 1847 AAGCTCTCCT GGGACCTGGC 

184 8 GGACTTCGCC 780 
E--> 18 50 AAAAGGCGGG AGCCCGACCG 

1851 TGAGTTTGGC 840 
E--> 1853 AGCCTCCTCC ACGAGTTCGG 

1854 GGCCCCCTGG 900 
E--> 1856 CCCCCGCCGG AAGGGGCCTT 

1857 CATGTGGGCC 960 
E--> 1859 GATCTTCTGG CCCTGGCCGC 

1860 CGAGCCTTAT 1020 
E--> 1862 AAAGCCCTCA GGGACCTGAA 

1863 GAGCGTTCTG 1080 
E--> 1865 GCCCTGAGGG AAGGCCTTGG 

1866 CGCCTACCTC 1140 
E--> 1868 CTGGACCCTT CCAACACCAC 

1869 GGAGTGGACG 1200 
E--> 1871 GAGGAGGCGG GGGAGCGGGC 

1872 GTGGGGGAGG 1260 
E--> 1874 CTTGAGGGGG AGGAGAGGCT 

1875 CCTTTCCGCT 1320 
E--> 1877 GTCCTGGCCC ACATGGAGGC 

1878 CAGGGCCTTG 1380 
E--> 1880 TCCCTGGAGG TGGCCGGGGA 

1881 CCTGGCCGGC 1440 
E--> 1883 CACCCCTTCA ACCTCAACTC 

1884 CGAGCTAGGG 1500 
E--> 1886 CTTCCCGCCA TCGGCAAGAC 

1887 CGCCGTCCTG 1560 
E--> 1889 GAGGCCCTCC GCGAGGCCCA 

1890 GGAGCTCACC 1620 



A:\Seq-sub. app 

N:\CRF3\03152002\I94 0925.raw 

TGACGCCAAG GCCCCCTCCT TCCGCCACGA 
CCCCACGCCG GAGGACTTTC CCCGGCAACT 
GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA 
CAAGAAGGCG GAAAAGGAGG GCTACGAGGT 
CCAGCTCCTT TCCGACCGCA TCCACGTCCT 
CTGGCTTTGG GAAAAGTACG GCCTGAGGCC 
CGGGGACGAG TCCGACAACC TTCCCGGGGT 
GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC 
CGCCATCCGG GAGAAGATCC TGGCCCACAT 
CAAGGTGCGC ACCGACCTGC CCCTGGAGGT 
GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT 
CCTTCTGGAA AGCCCCAAGG CCCTGGAGGA 
CGTGGGCTTT GTGCTTTCCC GCAAGGAGCC 
CGCCAGGGGG GGCCGGGTCC ACCGGGCCCC 
GGAGGCGCGG GGGCTTCTCG CCAAAGACCT 
CCTCCCGCCC GGCGACGACC CCATGCTCCT 
CCCCGAGGGG GTGGCCCGGC GCTACGGCGG 
CGCCCTTTCC GAGAGGCTCT TCGCCAACCT 

* 

CCTTTGGCTT TACCGGGAGG TGGAGAGGCC 
CACGGGGGTG CGCCTGGACG TGGCCTATCT 
GATCGCCCGC CTCGAGGCCG AGGTCTTCCG 
CCGGGACCAG CTGGAAAGGG TCCTCTTTGA 
GGAGAAGACC GGCAAGCGCT CCACCAGCGC 
CCCCATCGTG GAGAAGATCC TGCAGTACCG 
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E--> 1892 AAGCTGAAGA GCACCTACAT TGACCCCTTG CCGGACCTCA TCCACCCCAG 

1893 GACGGGCCGC 1680 
E--> 1895 CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA GGCTAAGTAG 

1896 CTCCGATCCC 1740 
E--> 1898 AACCTCCAGA ACATCCCCGT CCGCACCCCG CTTGGGCAGA GGATCCGCCG 

1899 GGCCTTCATC 1800 
E--> 1901 GCCGAGGAGG GGTGGCTATT GGTGGCCCTG GACTATAGCC AG AT AG AG CT 

1902 CAGGGTGCTG 1860 
E--> 1904 GCCCACCTCT CCGGCGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGCG 

1905 GGACATCCAC 1920 
E--> 1907 ACGGAGACCG CCAGCTGGAT GTTCGGCGTC CCCCGGGAGG CCGTGGACCC 

1908 CCTGATGCGC 1980 
E--> 1910 CGGGCGGCCA AG AC CATC AA CTTCGGGGTC CTCTACGGCA TGTCGGCCCA 

1911 CCGCCTCTCC 2040 
E--> 1913 CAGGAGCTAG CCATCCCTTA CGAGGAGGCC CAGGCCTTCA TTGAGCGCTA 

1914 CTTTCAGAGC 2100 
E--> 1916 TTCCCCAAGG TGCGGGCCTG GATTGAGAAG ACCCTGGAGG AGGGCAGGAG 

1917 GCGGGGGTAC 2160 
E--> 1919 GTGGAGACCC TCTTCGGCCG CCGCCGCTAC GTGCCAGACC TAGAGGCCCG 

1920 GGTGAAGAGC 2220 
E--> 1922 GTGCGGGAGG CGGCCGAGCG CATGGCCTTC AACATGCCCG TCCGGGGCAC 

1923 CGCCGCCGAC 2280 
E--> 1925 CTCATGAAGC TGGCTATGGT GAAGCTCTTC CCCAGGCTGG AGGAAATGGG 

1926 GGCCAGGATG 2340 
E--> 1928 CTCCTTCAGG TCCACGACGA GCTGGTCCTC GAGGCCCCAA AAGAGAGGGC 

1929 GGAGGCCGTG 2400 
E--> 1931 GCCCGGCTGG CCAAGGAGGT CATGGAGGGG GTGTATCCCC TGGCCGTGCC 

1932 CCTGGAGGTG 2460 
E~-> 1934 GAGGTGGGGA TAGGGGAGGA CTGGCTCTCC GCCAAGGAGT GA 
W--> 1935 2502 

1953 (2) INFORMATION FOR SEQ ID NO: 23: 

1955 (i) SEQUENCE CHARACTERISTICS: 

1956 (A) LENGTH: 72 base pairs 

1957 (B) TYPE: nucleic acid 

1958 (C) STRANDEDNESS : single 

1959 (D) TOPOLOGY: linear 

1961 (ii) MOLECULE TYPE: DNA (genomic) 

1965 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

E--> 1967 CGGACGAACA AG CG AG AC AG CGACACAGGT ACCACATGGT ACAAGAGGCA 
1968 AGAGAGACGA 60 
1970 CACAGCAGAA AC 

1972 (2) INFORMATION FOR SEQ ID NO: 24: 

1974 (i) SEQUENCE CHARACTERISTICS: 

1975 (A) LENGTH: 70 base pairs 

1976 (B) TYPE: nucleic acid 

1977 (C) STRANDEDNESS: single 

1978 (D) TOPOLOGY: linear 

1980 (ii) MOLECULE TYPE: DNA (genomic) 

1984 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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E--> 1986 GTTTCTGCTG TGTCGTCTCT CTTGCCTCTT GTACCATGTG GTACCTGTGT 

1987 CGCTGTCTCG 60 

1989 CTTGTTCGTC 70 
2023 (2) INFORMATION FOR SEQ ID NO: 27: 

2025 (i) SEQUENCE CHARACTERISTICS: 

2026 (A) LENGTH: 46 base pairs 

2027 (B) TYPE: nucleic acid 

2028 (C) STRANDEDNESS: single 

2029 (D) TOPOLOGY: linear 

2031 (ii) MOLECULE TYPE: DNA (genomic) 

2035 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

E--> 2037 CCTCTTGTAC CATGTGGTAC CTGTGTCGCT GTCTCGCTTG TTCGTC 
W--> 2038 46 

2040 (2) INFORMATION FOR SEQ ID NO: 28: 

2042 (i) SEQUENCE CHARACTERISTICS: 

2043 (A) LENGTH: 50 base pairs 

2044 (B) TYPE: nucleic acid 

2045 (C) STRANDEDNESS: single 

2046 (D) TOPOLOGY: linear 

2048 (ii) MOLECULE TYPE: DNA (genomic) 

2052 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

E--> 2054 AC AC AG GT AC CACATGGTAC AAGAGGCAAG AGAGACGACA CAGCAGAAAC 
W--> 2055 50 

2074 (2) INFORMATION FOR SEQ ID NO: 30: 

2076 (i) SEQUENCE CHARACTERISTICS: 

2077 (A) LENGTH: 969 base pairs 

2078 (B) TYPE: nucleic acid 

2079 (C) STRANDEDNESS: double 

2080 *(D) TOPOLOGY: linear 

2082 (ii) MOLECULE TYPE: DNA (genomic) 

2086 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

E--> 2088 ATGGCTAGCA TGACTGGTGG AC AG C AAATG GGTCGGATCA ATTCGGGGAT 

2089 GCTGCCCCTC 60 
E--> 2091 TTTGAGCCCA AGGGCCGGGT CCTCCTGGTG GACGGCCACC ACCTGGCCTA 

2092 CCGCACCTTC 120 
E--> 2094 CACGCCCTGA AGGGCCTCAC CACCAGCCGG GGGGAGCCGG TGCAGGCGGT 

2095 CTACGGCTTC 180 
E--> 2097 GCCAAGAGCC TCCTCAAGGC CCTCAAGGAG GACGGGGACG CGGTGATCGT 

2098 GGTCTTTGAC 240 
E--> 2100 GCCAAGGCCC CCTCCTTCCG CCACGAGGCC TACGGGGGGT ACAAGGCGGG 

2101 CCGGGCCCCC 300 
E--> 2103 ACGCCGGAGG ACTTTCCCCG GCAACTCGCC CTCATCAAGG AGCTGGTGGA 

2104 CCTCCTGGGG 360 
E--> 2106 CTGGCGCGCC TCGAGGTCCC GGGCTACGAG GCGGACGACG TCCTGGCCAG 

2107 CCTGGCCAAG 420 
E--> 2109 AAGGCGGAAA AGGAGGGCTA CGAGGTCCGC ATCCTCACCG CCGACAAAGA 

2110 CCTTTACCAG 480 
E--> 2112 CTTCTTTCCG ACCGCATCCA CGTCCTCCAC CCCGAGGGGT ACCTCATCAC 

2113 CCCGGCCTGG 540 



file://C:\Crf3\Outhold\VsrI940925.htm 



3/15/02 




Page 18 of 23 



RAW SEQUENCE LISTING DATE: 03/15/2002 

PATENT APPLICATION: US/09/94 0 , 925 TIME: 14:41:44 

Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 

E--> 2115 CTTTGGGAAA AGTACGGCCT GAGGCCCGAC CAGTGGGCCG ACTACCGGGC 

2116 CCTGACCGGG 600 
E--> 2118 GACGAGTCCG ACAACCTTCC CGGGGTCAAG GGCATCGGGG AGAAGACGGC 

2119 GAGGAAGCTT 660 
E--> 2121 CTGGAGGAGT GGGGGAGCCT GGAAGCCCTC CTCAAGAACC TGGACCGGCT 

2122 GAAGCCCGCC 720 
E--> 2124 ATCCGGGAGA AGATCCTGGC CCACATGGAC GATCTGAAGC TCTCCTGGGA 

2125 CCTGGCCAAG 780 
E--> 2127 GTGCGCACCG ACCTGCCCCT GGAGGTGGAC TTCGCCAAAA GGCGGGAGCC 

2128 CGACCGGGAG 840 
E--> 2130 AGGCTTAGGG CCTTTCTGGA GAGGCTTGAG TTTGGCAGCC TCCTCCACGA 

2131 GTTCGGCCTT 900 
E--> 2133 CTGGAAAGCC CCAAGTCATG GAGGGGGTGT ATCCCCTGGC CGTGCCCCTG 

2134 GAGGTGGAGG 960 

2136 TGGGGATAG 

2138 (2) INFORMATION FOR SEQ ID NO: 31: 

2140 (i) SEQUENCE CHARACTERISTICS: 

2141 (A) LENGTH: 948 base pairs 

2142 (B) TYPE: nucleic acid 

2143 (C) STRANDEDNESS :. double 

2144 (D) TOPOLOGY: linear 

2146 (ii) MOLECULE TYPE: DNA (genomic) 

2150 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 

E--> 2152 ATGGCTAGCA TGACTGGTGG ACAGCAAATG GGTCGGATCA ATTCGGGGAT 

2153 GCTGCCCCTC 60 
E--> 2155 TTTGAGCCCA AGGGCCGGGT CCTCCTGGTG GACGGCCACC ACCTGGCCTA 

2156 CCGCACCTTC 120 
E--> 2158 CACGCCCTGA AGGGCCTCAC CACCAGCCGG GGGGAGCCGG TGCAGGCGGT 

2159 CTACGGCTTC 180 
E--> 2161 GCCAAGAGCC TCCTCAAGGC CCTCAAGGAG GACGGGGACG CGGTGATCGT 

2162 GGTCTTTGAC 240 
E--> 2164 GCCAAGGCCC CCTCCTTCCG CCACGAGGCC TACGGGGGGT ACAAGGCGGG 

2165 CCGGGCCCCC 300 
E--> 2167 ACGCCGGAGG ACTTTCCCCG GCAACTCGCC CTCATCAAGG AGCTGGTGGA 

2168 CCTCCTGGGG .360 
E--> 2170 CTGGCGCGCC TCGAGGTCCC GGGCTACGAG GCGGACGACG TCCTGGCCAG 

2171 CCTGGCCAAG 420 
E--> 2173 AAGGCGGAAA AGGAGGGCTA CGAGGTCCGC ATCCTCACCG CCGACAAAGA 

2174 CCTTTACCAG 480 
E--> 2176 CTTCTTTCCG ACCGCATCCA CGTCCTCCAC CCCGAGGGGT ACCTCATCAC 

2177 CCCGGCCTGG 540 
E--> 2179 CTTTGGGAAA AGTACGGCCT GAGGCCCGAC CAGTGGGCCG ACTACCGGGC 

2180 CCTGACCGGG 600 
E--> 2182 GACGAGTCCG ACAACCTTCC CGGGGTCAAG GGCATCGGGG AGAAGACGGC 

2183 GAGGAAGCTT 660 
E--> 2185 CTGGAGGAGT GGGGGAGCCT GGAAGCCCTC CTCAAGAACC TGGACCGGCT 

2186 GAAGCCCGCC 720 
E--> 2188 ATCCGGGAGA AGATCCTGGC CCACATGGAC GATCTGAAGC TCTCCTGGGA 

2189 CCTGGCCAAG 7 80 
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E--> 2191 GTGCGCACCG ACCTGCCCCT GGAGGTGGAC TTCGCCAAAA GGCGGGAGCC 

2192 CGACCGGGAG 840 
E--> 2194 AGGCTTAGGG CCTTTCTGGA GAGGCTTGAG TTTGGCAGCC TCCTCCACGA 

2195 GTTCGGCCTT 900 
E--> 2197 CTGGAAAGCC CCAAGGCCGC ACTCGAGCAC CACCACCACC ACCACTGA 
W--> 2198 948 

2200 (2) INFORMATION FOR SEQ ID NO: 32: 

2202 (i) SEQUENCE CHARACTERISTICS: 

2203 (A) LENGTH: 206 base pairs . 

2204 (B) TYPE: nucleic acid / 

2205 (C) STRANDEDNESS: double 

2206 (D) TOPOLOGY: linear 
2208 (ii) MOLECULE TYPE: DNA (genomic) 
2212 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

E--> 2214 CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG 

2215 TAATACGACT 60 
E--> 2217 CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC 

2218 CTGCAGGCAT 120 
E--> 2220 GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG 

2221 GTCATAGCTG 180 

2223 TTTCCTGTGT GAAATTGTTA TCCGCT 206 
2289 (2) INFORMATION FOR SEQ ID NO: 37: 

2291 (i) SEQUENCE CHARACTERISTICS: 

2292 (A) LENGTH: 43 base pairs 

2293 (B) TYPE: nucleic acid 

2294 (C) STRANDEDNESS: single 

2295 (D) TOPOLOGY: linear 

2297 (ii) MOLECULE TYPE: DNA (genomic) 

2301 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

E--> 2303 GACGAACAAG CGAGACAGCG AC C AG AG AG C AGAGAACCCA GAA 
W--> 2304 43 

2338 (2) INFORMATION FOR SEQ ID NO: 40: 

234 0 (i) SEQUENCE CHARACTERISTICS: 

2341 (A) LENGTH: 157 base pairs 

2342 (B) TYPE: nucleic acid 

2343 (C) STRANDEDNESS: double 

2344 (D) TOPOLOGY: linear 

2346 (ii) MOLECULE TYPE: DNA (genomic) 

2350 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

E--> 2352 CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA 

2353 TAACCGGGAA 60 
E--> 2355 TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT 

2356 TTCATCCAAA 120 

2358 GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 157 
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30 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 

31 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
58 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:1 

254 Repeated in SeqNo=l 

182 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
198 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:2 
254 Repeated in SeqNo=2 

337 M:254 E: No. of Bases conflict, Input :0 Counted: 50 SEQ : 3 
254 Repeated in SeqNo=3 

461 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
991 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:7 
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1584 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:12 
254 Repeated in SeqNo=12 

Invalid Amino Acid Number in Coding Region, SEQ ID 
No. of Bases conflict, Input :0 Counted: 50 SEQ: 15 
No. of Bases conflict, Input :0 Counted: 42 SEQ: 19 
Invalid Amino Acid Number in Coding Region, SEQ ID 
No. of Bases conflict, Input :0 Counted: 50 SEQ: 21 



1663 M: 336 W 

1711 M:254 E 

1778 M:254 E 

1779 M:336 W 
1811 M:254 E 



12 



19 
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VERIFICATION SUMMARY 
PATENT APPLICATION: US/09/94 0,925 

Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 



DATE: 03/15/2002 
TIME: 14:41:46 



M:254 Repeated 
L:1935 M:336 W 
L:1967 M:254 E 
L:1986 M:254 E 
L:2037 M:254 E 
L:2038 M:336 W 
L:2054 M:254 E 
L:2055 M:336 W 
L:2088 M:254 E 
M:254 Repeated 
L:2152 M:254 E: 
M:254 Repeated 
L:2198 M:336 W: 
L:2214 M:254 E: 
M:254 Repeated 
L:2303 M:254 E 
L:2304 M:336 W 
L:2352 M:254 E 
M:254 Repeated 
L:2374 M:254 E: 
M:254 Repeated 
L:2428 M:254 E: 
M:254 Repeated 
L:2491 M:254 E: 
M:254 Repeated 
L:2513 M:254 E: 
M:254 Repeated 
L: 2535 M:254 E: 
M:254 Repeated 
L:2542 M:336 W: 
L:2558 M:254 E: 
M:254 Repeated 
L:2631 M:254 E: 
M:254 Repeated 
L:2653 M:254 E: 
M:254 Repeated 
L:2675 M:254 E 
L:3255 M:336 W 
L:3421 M:336 W 
L: 3515 M: 336 W 
L:3547 M:336 W 
L:3579 M:336 W 
L:4534 M:336 W 
L:4608 M:336 W 
L:4682 M:336 W 
L:5487 M:336 W 
L: 5550 M: 336 W 
L:5579 M:336 W 
L:5608 M:336 W 



in SeqNo=21 
Invalid Amino Acid Number in 
No. of Bases conflict, Input 
No. of Bases conflict, Input 
No. of Bases conflict, Input 
Invalid Amino Acid Number in 
No. of Bases conflict, Input 
Invalid Amino Acid Number in 
No. of Bases conflict, Input 

in SeqNo=30 
No. of Bases conflict, Input 

in SeqNo=31 
Invalid Amino Acid Number in 
No. of Bases conflict, Input 

in SeqNo=32 
No. of Bases conflict, Input 
Invalid Amino Acid Number in 
No. of Bases conflict, Input 

in SeqNo=4 0 
No. of Bases conflict, 

in SeqNo=41 
No. of Bases conflict, 

in SeqNo=44 
No. of Bases conflict, 

in SeqNo=4 7 
No. of Bases conflict, 

in SeqNo=48 
No. of Bases conflict, 

in SeqNo=4 9 
Invalid Amino Acid Number in 
No. of Bases conflict, Input 

in SeqNo=50 
No. of Bases conflict, 

in SeqNo=54 
No. of Bases conflict, 

in SeqNo=55 
No. of Bases conflict, 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 



Coding Region, SEQ ID: 21 
:0 Counted: 50 SEQ: 23 
:0 Counted: 50 SEQ: 24 
:0 Counted: 46 SEQ: 27 

Coding Region, SEQ ID: 27 
:0 Counted: 50 SEQ: 28 

Coding Region, SEQ ID: 28 
:0 Counted: 50 SEQ: 30 

:0 Counted: 50 SEQ: 31 

Coding Region, SEQ ID: 31 
:0 Counted: 50 SEQ: 32 

:0 Counted: 43 SEQ: 37 
Coding Region, SEQ ID: 37 
:0 Counted: 50 SEQ: 40 



Input :0 Counted: 50 SEQ: 41 
Input :0 Counted: 50 SEQ: 44 
Input :0 Counted: 50 SEQ: 47 
Input :0 Counted: 50 SEQ: 48 
Input :0 Counted: 50 SEQ: 49 



Coding Region, SEQ ID: 49 
:0 Counted: 50 SEQ: 50 



Input :0 Counted: 50 SEQ: 54 
Input :0 Counted: 50 SEQ: 55 



Input 
in 
in 
in 
in 
in 
in 
in 
in 
in 
in 



:0 Counted: 50 
Coding Region 
Coding Region 
Coding Region 
Coding Region 
Coding Region 
Coding Region 
Coding Region 
Coding Region 
Coding Region 
Coding Region 
Coding Region 
Coding Region 



SEQ:56 

, SEQ ID: 70 

, SEQ ID: 76 

, SEQ ID: 79 

, SEQ ID: 80 

, SEQ ID: 81 

, SEQ ID: 92 

, SEQ ID: 93 

, SEQ ID: 94 

, SEQ ID: 121 

, SEQ ID:123 

, SEQ ID: 124 

, SEQ ID:125 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/940, 925 



DATE: 03/15/2002 
TIME: 14:41:46 



Input Set : A:\Seq-sub.app 

Output Set: N:\CRF3\03152002\I940925.raw 



L:5637 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 126 

L:5666 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 127 

L:5695 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 128 

L:5724 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 129 

L:5753 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 130 

L:5782 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 131 

L:5811 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 132 

L:6723 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 158 

L:6924 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 161 

L:7017 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:165 
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